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We investigate the magic number problem, that is, the question whether there exists a minimal «-state 
nondeterministic finite automaton (NFA) whose equivalent minimal deterministic finite automaton 
(DFA) has a states, for all n and a satisfying n < a < 2". A number a not satisfying this condition 
is called a magic number (for n). It was shown in ifTTI that no magic numbers exist for general 
regular languages, while in [5| trivial and non-trivial magic numbers for unary regular languages 
were identified. We obtain similar results for automata accepting subregular languages like, for 
example, combinational languages, star-free, prefix-, suffix-, and infix-closed languages, and prefix-, 
suffix-, and infix-free languages, showing that there are only trivial magic numbers, when they exist. 
For finite languages we obtain some partial results showing that certain numbers are non-magic. 



1 Introduction 

Nondeterministic finite automata (NFAs) are probably best known for being equivalent to right-linear 
context-free grammars and, thus, for capturing the lowest level of the Chomsky-hierarchy, the family of 
regular languages. It is well known that NFAs can offer exponential saving in space compared with de- 
terministic finite automata (DFAs), that is, given some rc-state NFA one can always construct a language 
equivalent DFA with at most 2" states 1231 . This so-called powerset construction turned out to be opti- 
mal, in general. That is, the bound on the number of states is tight in the sense that for an arbitrary n there 
is always some w-state NFA which cannot be simulated by any DFA with less than 2" states irT7ll2~l"ll2"2"Tl . 
On the other hand, there are cases where nondeterminism does not help for the succinct representation 
of a language compared to DFAs. These two milestones from the early days of automata theory form 
part of an extensive list of equally striking problems of NFA related problems, and are a basis of de- 
scriptional complexity. Moreover, they initiated the study of the power of resources and features given 
to finite automata. For recent surveys on descriptional complexity issues of regular languages we refer 
to, for example, HUH El 

Nearly a decade ago a very fundamental question on the well known subset construction was raised 
in Q : Does there always exists a minimal «-state NFA whose equivalent minimal DFA has a states, for 
all n and a with n < a < 2"? A number a not satisfying this condition is called a magic number for n. 
The answer to this simple question turned out not to be so easy. For NFAs over a two-letter alphabet it 
was shown that a = 2" - 2 k or 2 n - 2 k - 1, for < k < n/2 - 2 [9], and a = 2" - k, for 5 < k < 2n - 2 
and some coprimality condition for k |[T0l , are non-magic. In lfT2l it was proven that the integer a is 
non-magic, if n < CC < 1 +n(n + l)/2. This result was improved by showing that a is non-magic for 
n < a < 2^" in lfT3l . Further non-magic numbers for two-letter input alphabet were identified in 
and |fl9l . It turned out that the problem becomes easier if one allows more input letters. In fact, for 
exponentially growing alphabets there are no magic numbers at all lTT2l . This result was improved to 
less growing alphabets in (H, to constant alphabets of size four in [IT], and very recently to three-letter 
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alphabets |fl5l . Magic numbers for unary NFAs were recently studied in (5j by revising the Chrobak 
normal-form for NFAs. In the same paper also a brief historical summary of the magic number problem 
can be found. Further results on the magic number problem (in particular in relation to the operation 
problem on regular languages) can be found, for example, in |[T3l[T4l . 

To our knowledge the magic number problem was not systematically studied for subregular lan- 
guages families (except for unary languages). Several of these subfamilies are well motivated by their 
representations as finite automata or regular expressions: finite languages (are accepted by acyclic fi- 
nite automata), combinational languages (are accepted by automata modeling combinational circuits), 
star-free languages or regular non-counting languages (which can be described by regular-like expres- 
sion using only union, concatenation, and complement), prefix-closed languages (are accepted by au- 
tomata where all states are accepting), suffix-closed (or multiple-entry or fully-initial) languages (are 
accepted by automata where the computation can start in any state), infix-closed languages (are accepted 
by automata where all states are both initial and accepting), suffix-free languages (are accepted by non- 
returning automata, that is, automata where the initial state does not have any in- transition), prefix-free 
languages (are accepted by non-exiting automata, that is, automata where all out-transitions of every 
accepting state go to a rejecting sink state), and infix-free languages (are accepted by non-returning and 
non-exiting automata, where these conditions are necessary, but not sufficient). 

The hierarchy of these and some further subregular language families is well known. We study all 
families mentioned with respect to the magic number problem, and show — except for finite languages, 
where only some partial results will be presented — that there are only trivial magic numbers, whenever 
they exist. 

2 Definitions 

Let E* denote the set of all words over the finite alphabet £. For n>0we write E" for the set of all words 
of length n. The empty word is denoted by A and E + = E* \ {A}. A language L over £ is a subset of E*. 
For the length of a word w we write \w\. Set inclusion is denoted by C and strict set inclusion by C. We 
write 2 s for the power set and \S\ for the cardinality of a set 5. 

A nondeterministic finite automaton (NFA) is a quintuple A = (<2,E, 8,qo,F), where Q is the finite 
set of states, E is the finite set of input symbols, qo G Q is the initial state, F C Q is the set of accepting 
states, and 8 : Q x E — > 2® is the transition function. As usual the transition function is extended to 
8 : Q x E* — > 2@ reflecting sequences of inputs: 8(q,X) = {q} and 8(q,aw) = [j q ' e s(qa) 8(q',w), for 
q G Q, a G E, and w G E*. A word w G E* is accepted by A if 8(qo, w) OF / 0. The language accepted 
by A is L(A) = { w G E* | w is accepted by A }. 

A finite automaton is deterministic (DFA) if and only if \8(q,a)\ = 1, for all q G Q and a G E. In this 
case we simply write 8(q,a) = p for 8{q,a) = {p} assuming that the transition function is a mapping 
8 : Q x E — > Q. So, any DFA is complete, that is, the transition function is total, whereas for NFAs 
it is possible that 8 maps to the empty set. Note that a sink state is counted for DFAs, since they are 
always complete, whereas it is not counted for NFAs, since their transition function may map to the 
empty set. In the sequel we refer to the DFA obtained from an NFA A = (<2,£, 8,qo,F) by the power- 
set construction as A' = (2 e ,L,8',{q },F'), where S'(P,a) = \J peP 8(p,a), for P C Q and a G E, and 
F' = {PC £|PnF/0}. 

As already mentioned in the introduction, in |[TT1l it was shown that for all integers n and a such 
that n < a < 2", there exists an n-state nondeterministic finite automaton A n a whose equivalent minimal 
deterministic finite automaton has exactly a states. Since some of our constructions rely on this proof 
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and for the sake of completeness and readability we briefly recall the sketch of the construction. In the 
following we call the NFAA„ )Ct the Jirasek-Jiraskova-Szabari automaton, or for short the JJS -automaton. 

Theorem 1 ( 1111 ) For all integers n and a such that n<a< 2", there exists an n-state nondeterministic 
finite automaton A n ^ a whose equivalent minimal deterministic finite automaton has exactly a states. 

In the construction for some fixed integer n the cases a = n and a = 2" are treated separately by 
appropriate witness languages. For the remaining cases it is first shown that every a satisfying n < a < 2" 
can be written as a specific sum of powers of two. In particular, for all integers n and a such that 
n < a < 2", there exist integers k and m with 1 < k < n — \ and 1 < m < 2 k , such that 

n- (k+ \)+2 k + m 



and 



a 



in 



(2 kl - 1) + (2 h -!) + ••• + (2**- 1 - 1) + 




where 1 < t < k — 1 and k>k\ > ki > • ■ ■ > kg > 1. Then NFAs are constructed such that the powerset 
construction yields DFAs whose number of states is exactly one of these powers of two, which finally 
have to be combined appropriately to lead to a single Ti-state NFA A n ^ a whose equivalent minimal DFA 
has exactly a states. Automaton A n a is depicted in Figure [T] where the following <i-transitions are not 
shown: 

' {0,2,3,4,..., k-ki + 1} if 1< i<£-l 
{0, 1, . . . , k — ki + 1} if i = I and m is of the first form 
8(i,d) = \ {0,2,3,4, . . . ,k — ki + 1} if i = I and m is of the second form 

{0, 1, . . . , k — ki + 1} if i = £ + 1 and m is of the second form 
otherwise. 




Figure 1: Jirasek-Jiraskova-Szabari's (JJS) nondeterministic finite automaton A n a with n states 
(^-transitions are not shown) accepting a language for which the equivalent minimal DFA needs exactly 
a = n — (k+l) +m states. 



3 Results 



We systematically investigate the magic number problem for the aforementioned subregular language 
families. For the remaining theorems of this paper, when speaking of an n-state NFA we always mean a 
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minimal NFA. Given a subregular language family, if f(n) is the number of states that is sufficient and 
necessary in the worst case for a DFA to accept the language of an n-state NFA belonging to the family, 
then a number a with f(n) < a < 2" is called a trivial magic number. Similarly, if g{n) is the number 
of states that is necessary for any DFA simulating an arbitrary n-state NFA, then all numbers a with 
a < g{n) is also called a trivial magic number. For example, for infix-free languages g(n) is shown to 
be n + 1 in Theorem[5] while f(n) is known to be 2"~ 2 + 2 [ 1 ]. Due to space constraints most proofs are 
omitted. 

An observation from HI shows that the magic number problem for elementary and combinational 
languages is trivial. 

3.1 Star-Free Languages and Power Separating Languages 

A language L C E* is star-free (or regular non-counting) if and only if it can be obtained from the 
elementary languages {a}, for a G £, by applying the Boolean operations union, complementation, and 
concatenation finitely often. These languages are exhaustively studied, for example, in (20J. Since 
regular languages are closed under Boolean operations and concatenation, every star-free language is 
regular. On the other hand, not every regular language is star free. 

Here we use an alternative characterization of star-free languages by so called permutation-free au- 
tomata |[20l : A regular language L C Z* is star-free if and only if the minimal DFA accepting L is 
permutation-free, that is, there is no word w£l* that induces a non-trivial permutation of any subset of 
the set of states. Here a trivial permutation is simply the identity permutation. Observe that a word uw 
induces a non-trivial permutation {q\,q2,- ■ • ,<?«} C Q in a DFA with state set Q and transition function 8 
if and only if wu induces a non-trivial permutation {S(q\ , u), 8(q2, u) , . . . , 8(q n , u)} in the same automa- 
ton. Further, if one finds a non-trivial permutation consisting of multiple disjoint cycles, it suffices to 
consider a single cycle. Before we show that no magic numbers exist for star-free languages we prove a 
useful lemma on permutations in (minimal) DFAs obtained by the powerset construction. 

Lemma 2 Let Abe a nondeterministic finite automaton with state set Q over alphabet £, and assume 
that A' is the equivalent minimal deterministic finite automaton, which is non-permutation-free. If the 
word w in L* induces a non-trivial permutation on the state set {Pq,Pi, . . . ,P n -i} C 2^ of A', that is, 
8'(Pi,w) = Pi + \,for < i < n — 1, and S'(P n -i,w) = Pq, then there are no two states Pi and Pj with i ^ j 
such that Pi C Pj. 

Proof: Assume to the contrary that Pq C P { (possibly after a cyclic shift), for some < i < n — 1. Then 
one can show by induction that 8'{Pq,v) C 8'(Pi,v), for every word v G £*. In particular, this also 
holds true for the word w that induces the non-trivial permutation on the state set {Pq,Pi, . . . But 
then Pki mod n = 8'(Po,w kl ) C 8'(Pi,w h ) = P(k+i)imodn> f° r & > 0, and one finds the chain of inclusions 
Pa C Pi C P 2i mod „ C P 3i mod „ C • • • C P ni mod „ = P Q , which implies P = P t , a contradiction. □ 

Now we are prepared for the main theorem, which utilizes Lemma 12 

Theorem 3 For all integers n and (X such that n < a < 2 n , there exists an n-state nondeterministic finite 
automaton accepting a star-free language whose equivalent minimal deterministic finite automaton has 
exactly a states. 

The previous theorem generalizes to all language families that are a superset of the family of star-free 
languages such as, for example, the family of power separating languages introduced in l25ll . 
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3.2 Stars and Comet Languages 

A language L C E* is a star language if and only if L = H*, for some regular language H C E*, and 
L C E* is a comet language if and only if it can be represented as concatenation G*H of a regular star 
language G* C E* and a regular language H C E*, such that G ^ {A} and G^0. Star languages and 
comet languages were introduced in Q and Q. Next, a language L C E* is a two-sided comet language 
if and only if L = EG*H, for a regular star language G* C E* and regular languages Z?,// C E*, such that 
G 7^ {A} and G / 0. So, (two-sided) comet languages are always infinite. Clearly, every star language not 
equal to {A} is also a comet language and every comet is a two-sided comet language, but the converse 
is not true in general. 

Theorem 4 For all integers n and a such that n < a < 2", there exists an n-state nondeterministic 
finite automaton accepting a star language whose equivalent minimal deterministic finite automaton has 
exactly a states. The statement remains valid for (two-sided) comet languages. 

3.3 Subword Specific Languages 

In this section we consider languages for which for every word in the language either all or none of its 
prefixes, suffixes or infixes belong to the same language. Again, there are only trivial magic numbers. 
We start with subword-free languages. 

A language L C E* is prefix-free if and only if y G L implies yz £ L, for all z G E + , infix-free if and 
only if y G L implies xyz £ L, for all xz G E + , and suffix-free if and only if y G L implies xy L, for all 
*GE+. 

Theorem 5 Let A be a minimal n-state NFA accepting a non-empty prefix-, suffix- or infix-free language. 
Then any equivalent minimal DFA accepting language L(A) needs at least n+ 1 states. 

In the following we show that no non-trivial magic numbers exist for subword-free languages. The 
upper bound for the deterministic blow-up in prefix- and suffix-free languages is 2"~ 1 + 1 and for infix- 
free languages it is 2 + 2, so all numbers above are trivially magic. 

Theorem 6 For all integers n and a such that n < a < 2 n ~ l + 1, there exists an n-state nondetermin- 
istic finite automaton accepting a prefix-free language whose equivalent minimal deterministic finite 
automaton has exactly a states. The statement remains true for NFAs accepting suffix-free languages. 

For infix-free regular languages the situation is slightly different compared to above. 

Theorem 7 For all integers n and a such that n < a < 2 n ~ 2 + 2, there exists an n-state nondetermin- 
istic finite automaton accepting an infix-free language whose equivalent minimal deterministic finite 
automaton has exactly a states. 

Next, we consider prefix-, infix-, and suffix-closed languages. A language L G L* is prefix-closed if 
and only if xy G L implies x G L, for x G E*, infix-closed if and only if xyz G L implies y G L, for x,z G £*, 
and suffix-closed if and only if yz G L implies z G L, for z, G E*. We use the following results from |[T6l . 

Theorem 8 (I) A nonempty regular language is prefix-closed if and only if it is accepted by some non- 
deterministic finite automaton with all states accepting. (2) A nonempty regular language is infix-closed 
if and only if it is accepted by some nondeterministic finite automaton with multiple initial states with all 
states both initial and accepting. 
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Prefix-closed languages reach the upper bound of 2" states, and for infix-closed languages it is 
2«-i _j_ i Tjp t these bounds the only magic number for both language families is n (except for n = 1). 
The upper bound for suffix-closed languages is 2 n_1 + 1, and up to this, no magic numbers exist. 

Theorem 9 For all integers n and a such that n < a < 2 n ~ l + 1, there exists an n-state nondetermin- 
istic finite automaton accepting an infix-closed language whose equivalent minimal deterministic finite 
automaton has exactly a states. The case a = n can only be reached for n=\. 

Proof: For the second statement, note that each DFA accepting a language L / £* needs a non-accepting 
state, which the minimal NFA cannot have, due to Theorem [8] So, Z* is the only infix-closed language, 
for which the size of the minimal DFA equals the size of an equivalent minimal NFA. Both have a single 
state. The case a = 2 n ~ l + 1 is discussed in (T). For the remaining, assume n < a < 2 n ~ l . In this case, 
the JJS-automaton A na = (Q,L,8,n — l,{k}) has a non-empty initial tail of states, that is, the initial 
state is equal to state n— 1. From A„ M we construct an automaton A\ = (Q, £U {#,$}, 8\ , Q, Q) with all 
states initial and accepting and transition function Si (k,#) = {k}, 8i (q, $) = {q — 1} if k + 2 < q < n — 1, 
8\(k+ 1,$) = {1} and 8\{q,a) = 8(q,a) for < q < k and This NFA with multiple initial 

states can be converted into an equivalent NFA A2 with initial state n — 1 and the transition function 
8i{n — I, a) = {J qe Q8\{q,a) and 8i{q,a) = 8\{q,a) for all a G £U {#,$} and q £ Q\{n — I}. 

With Si = { (S'X'^^'c^ 1 ) I < i < n- (jfc+1) }, S 2 = { ($"-(*+ V,^" 1 -'') | 1 < i < k- 1 } and 
S 3 = d,c k )}, one can easily check, that S = Si U S 2 U S3 is a fooling set for L(A 2 ): Different 

pairs from Si result in a word beginning with more than n— (k + 1) $-symbols, pairs from S2 result in 
too many c-symbols, c k from S3 cannot be combined with any other word and mixing pairs from Si 
and S2 either results in a word containing the infix $c ! $ or, if (S" - ^ 1 ),^ -1 ) is chosen from Si, in 
§n-(k+i) c i+k-i ^ w j 1 | c ] 1 enc j s too man y c-symbols. 

In the corresponding powerset automaton A' 2 , by reading prefixes of one reaches n — (k + 1) 

states {n- 1}, {n-2,...,ife+l,l}, {Jk+1,1}. After reading $ r, - ( - k+1 \ A' 2 is in state {1} and from 
there, according to the J JS -construction, 2 k + m states from 2^ ' 1 '" '^ are reachable. So we have exactly a 
states. To see that no further states can be reached, note that the transition function differs from the one 
of the JJS-automaton only in states k+l,...,n — l and state k. The #-transition in state k gives no new 
reachable states and reading $ always leads to either a state {n — i, . . . , k + 1 , 1}, for some 1 <i<n — k+l, 
or to state {1} or the empty set. So, the only interesting transitions are those of the initial state {n — 1} 
on the input symbols a, b, c and d. Reading a or b leads to {0, . . . ,k}, reading c to {1, ... ,k} and on 
input d, A' 2 enters the state 8(q,d) for the largest q € Q for which this transition is defined. All these 
states were already counted. 

To prove that any two distinct states M,iVCg\{n-l} are pairwise inequivalent, without loss of 
generality, pick an element q G M\N. If q < k, the word c k ~ q # distinguishes M and Af. Otherwise, 
if q > k+ 1, one can drive it to state 1 by reading $-symbols, and then c k ~ l distinguishes the two states. 

Finally, state {n — 1} is inequivalent with any state Af C Q \ {n — 1} by the input word $ n ~( k+1 *>c k ~ l . 

□ 

The family of infix-closed languages is a subset of the family of suffix-closed languages, so the 
previous theorem generalizes to the latter language family, except for n which is not magic f or n > 1 
anymore: 

Corollary 10 For all integers n and a such that n < a < 2 n ~ x + 1, there exists an n-state nondetermin- 
istic finite automaton accepting a suffix-closed language whose equivalent minimal deterministic finite 
automaton has exactly a states. 
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Since the upper bound for the deterministic blow-up of prefix-closed languages is greater than that 
of infix-closed languages, we need to treat them separately here. 

Theorem 11 For all integers n and OC such that n < a < 2", there exists an n-state nondeterministic finite 
automaton accepting a prefix-closed language whose equivalent minimal deterministic finite automaton 
has exactly a states. The case a = n can only be reached for n=\. 

3.4 Finite Languages 

For finite languages the magic number problem turns out to be more challenging which seems to coincide 
with the fact, that the upper bounds for the deterministic blow-up of finite languages differ much from 
these of infinite language families. In ll24l it was shown that for each n-state NFA over an alphabet of 
size k, there is an equivalent DFA with at most 0(fc' ! /( 1 °gW+ 1 )) states. This matches an earlier result 
of <9(2"/ 2 ) for finite lan guages over binary alphabets 11181 . 

In this section we give some partial results for finite languages over a binary alphabet, that is, we 
show that a roughly quadratic interval beginning at n + 1 contains only non-magic numbers and that 
numbers of some exponential form 2("~ 1 ^ 2 + 2' are non-magic, too. Note that for finite languages, n is a 
trivial magic number, since any DFA needs a non-accepting sink state which is not necessary for an NFA. 

Theorem 12 For all integers n and CC such that n + 1 < a < (|) 2 + | + 1 if n is even, and 
n+ 1 < a < ( i ^) 2 + n+ 1 ifn is odd, there exists an n-state nondeterministic finite automaton accept- 
ing a finite language over a binary alphabet whose equivalent minimal deterministic finite automaton 
has exactly a states. 

Proof: The case a = n+\ can be seen with the witness language {a,b} n . So, assume n + 1 < CC. Then 
there exist integers k and m such that 

X k 

k = max{ x > | a > 1 + Y\ n — 2/} and m = a — 1 — — 2i. 

i=Q i=0 

Let A = ({l,...,n},{a,b},8, l,{n}) be an NFA with 8(q,a) = {q+ l,2q+ l,2q+2, . .. ,n}, 1 < q < k, 
8(k + I, a) = {(k + 1) + l,n— (m — 1),»— (m — 1) + 1, .. . ,n}, 8(q,a) = {q + 1}, for k+ 1 < q < n, and 
8(q,b) = {q+ 1}, for q < n. 

The transitions on b ensure the minimality of A and the inequivalence of states in the corresponding 
powerset automaton A'. To count all reachable states of A', we partition the set {a,b}* as follows: 

k 

{a,b}* = (J{^a}{a,H*U{^ +1 }{a^}*U{Z7}*. 
i=0 

With words from {b}*, the singletons {1}, . . . , {«} and are reachable — which gives n+l states. Next, 
let w = b'aw' and w' E {a,b}i for some integers < i < k— 1 and j > k. Then 

8'({\},w) = 8'({i+\},aw') 

= 5'({/ + 2,2(/+l) + l,2(/+l)+2,...,?i},w / ) 

= {i + j + 2, 2{i + 1) + j + 1 ,2(i + 1) + j + 2, . . . ,n}. (1) 

Since we already counted the singleton sets and the empty set, we have to count sets of the form (Q} 
having at least two elements. We conclude that the set {/ + j + 2,2(/ + 1) + j + 1,2(/ + 1) + j + 2, . . . ,n} 
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has cardinality at least 2 if and only if we have {2(i + 1) + j + 1,2(7 + 1) + j + 2, . . . ,n} ^ 0, which in 
turn holds if and only if < j < n — 2(i + 1) — 1. So, there are n — 2[i + 1) states reachable for a fixed 
i < k — 1, which gives Y?d=\ n — 2/ states that are reachable by reading words from {ft ! a}{a,fty with 
0<i<Jfc-l. 

Now let w = Maw' for some W G {a, ft} 7 . Then 

«'({l},w) = «'({*+ l},avt/) 

= 5'({fc + 2,n — (m— l),n— (m— 1) + 1, . . . ,n},w ) 

= {fc + 7 + 2,n-(m-l)+j>-(m-l) + 7 + l,...,n}. (2) 

These sets contain at least two elements if and only if < j < m — 1 . Therefore, exactly m sets of the 
form (0) are reachable in A'. Summing up, we get 

k k k 

m + n+l+^n — 2i = a — 1— — 2i + 1 + V n — 2i = a 

(=1 !=0 1=0 

states. 

To see that we have not multiply counted any state, note that there is no reachable set of states that 
satisfies (O and (0: Assume for some integers i,j,j',k with i < k that 

{i + j + 2,2(i+l)+j + l,...,n} = {k + f + 2,n-(m-l)+j',...,n}. 

Then of course i + j = k + f , and by definition of k and m we have 1 <m<n — 2(fc+l) — 1. Finally, we 
derive 

n-(m-l)+f>n-n + 2(k+l) + l+j' = k + k + j' + 4 

>i + k + j'+4 = i + i + j + 4>2(i + l)+j+l 

which is a contradiction to the assumption. □ 

For our last theorem we use the following results presented in lfT8l : For an integer n let k = [|] 
and A n = ({1, . . . ,n},{a,b},8, 1, {«}) be an NFA with transitions 8(q,a) = {q + l,k + 1} if q < k, 
8(q,a) = {q+ 1} if k < q < n, and 8(q,b) = {q+ 1} if q < n and q^k. Then in |[T8l it is shown 
that A n is minimal and that the minimal equivalent DFA has exactly 2("/ 2 ) +1 — 1 states if n is even, 
and 3 ■2^- n+l ^ 2 ^ 1 — 1 states if n is odd. This minimal DFA spans a binary tree on inputs a and b as 
depicted in Figure [2J 

Theorem 13 For all integers n and a such that a = 3 • 2("/ 2 )~ 1 + /3 if n is even and a = 2 ( - n+l ^ 2 + j8 
ifn is odd, with j8 = 2' — I for some integer 1 < i < [^-], there exists an n-state nondeterministic finite 
automaton accepting a finite language over a binary alphabet whose equivalent minimal deterministic 
finite automaton has exactly a states. 

Proof: Let n, a and j8 be as required andx = n+ 1 — log(j6 + 1). We construct a minimal automaton B n a 
adapting A M _i = ({1, . . . ,n — 1}, {a,b}, 8i , \,{n — 1}) from above by taking anew initial state and 
setting the transition function 8 to 5(0, b) = {1}, 5(0, a) = {l,x}, and 8{q,c) = 8\{q,c), for 1 < q < n— 1 
and letter c G {a, ft}. 

Let and be the powerset automata of and B n p. Then, by reading words bw' 
for w' G {a, ft}*, all states of A^_j are reachable in B' n ^. Together with the initial state {0}, these 
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Figure 2: The NFA A n from 1 18] and its powerset automaton that builds a binary tree. In the DFA on the 
right the transitions of states {3} and {3,4} are the same as for {3,5} and {3,4,5}, respectively. 

are 2^"- l ^ 2+1 states if n is odd, and 3-2"/ 2 " 1 states if n is even. For considering words of the form 
w = aw', for w' £ {a, b}* , let k = \' 1 + ] -~\ . Then k+l <x<n and we reach the states 

8'({0},aw) = 8'({l,x},w) = 8'({l},w)u8'({x},w). 

These states differ from the ones in A' n _ l as long as 8'({x},w) ^ 0, and this holds if and only if 
\w\ <n—x. There are 2"~ A+1 — 1 = j3 such words, so there are j3 additional states. 

□ 
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